Generalized Entropy approach to far-from-equilibrium statistical mechanics. 
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We present a new approach to far-from-equilibrium statistical mechanics, based on the concept 
of generalized entropy, which is a microscopically-defined generalization of Onsager-Machlup func- 
tional. In the case when a set of slow (adiabatic) variables can be chosen, our formalism yields a 
general form of the macroscopic evolution law (Generalized Langevin Equation) and extends Fluc- 
tuation Dissipative Theorem. It also provides for a simple understanding of recently-discovered 
Fluctuation Theorem. 
PACS numbers: 05.70.Ln, 05.20.-y 
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Foundations of non-equilibrium statistical physics re- 
main in focus of intensive research for nearly a century. 
Over past several decades, there has been a dramatic 
progress in application of stochastic equations to wide 
variety of complex systems, as well as in understanding 
some of their generic features. However, the methods 
of microscopic derivation of such coarse-grained descrip- 
tions (e.g. starting with a Hamiltonian), remain essen- 
tially limited to traditional kinetic theory and linear re- 
sponse scheme jjj 0. In this work, we emphasize the 
reductionistic mission of non-equilibrium statistical me- 
chanics, by building a constructive formalism whose con- 
ceptual framework resembles that of equilibrium theory. 
Our approach is not limited to the vicinity of thermal 
equilibrium, and becomes equivalent to the classical lin- 
ear response theory in this limit. 

The central concept in our discussion is Generalized 
Entropy (GE), which goes back to works by Onsager and 
Machlup H H. Further development of this paradigm 
includes its generalization for far-from-equilibrium case 
by Graham M and action-principle approach to Mar- 
covian stochastic dynamics by Eyink |^|3), whose tech- 
nique and conclusions have many common points with 
ours. The distinct feature of the present approach is an 
explicit microscopic interpretation of GE, which enables 
us to derive the macroscopic evolution equations starting 
from a microscopic level. Normally, entropy is assigned 
to a macroscopic state of a system as the logarithm of 
its statistical weight. Similarly, we introduce GE, as a 
logarithm of the statistical weight of a given macroscopic 
evolution (trajectory). We limit ourselves to the class 
of microscopic dynamics (parameterized with microvari- 
ables qj, j — 1...N) which are deterministic and phase- 
volume-preserving (as e.g. in Hamiltonian systems). In 
this case, there is a natural measure in the space qj, i.e. 
the statistical weight of any subset of the space is its 
phase volume. Let A = A(g) be a macroscopic (possi- 
bly, multicomponent) variable. The formal definition of 
the GE associated with a given evolution A(t) between 
two moments of time, U and tf, is the following: 



S[A(i)#=log 



z%] n s (A(«(t)) - aw) 



(1) 



Here, integration is performed over all actual microscopic 
trajectories qj(t), i.e. those satisfying the microscopic 
equations of motion. Since we assume the deterministic 
and phasc-volume-preserving evolution on that level, all 
the actual trajectories are equally weighted. 

Our next step is to distinguish between two contribu- 



tions to GE: S [A(i)]£ = S ( ° ] t (A) - 

first term, is the logarithm of the total number of 
trajectories of length At = t/ — f, with a fixed " midpoint" 
A = (A tj + A t/ ) /2. The other contribution, S( km \ to 
which we will refer as kinetic entropy, is the logarithm of 
the probability of a given evolution A(t) for the fixed U, 
tf and A: 



kin ) 



A(t) 



The 



S ( km) ^ A(t) j «' = l g ( J (j t A(q(t)) - A(t) 



\t=n 



log / £>[X(t)]exp S[X(f)] 



X(t) ■ A(t)dt . (2) 



Here we have transformed the expression by using the 
exponential representation of (^-function, which resulted 
in introduction of a new variable X conjugated to A. E 
is the generating functional for the variable A, which in 
particularly means that its variation of any order in X 
coincides with the corresponding irreducible correlator of 
the conjugated field (in our case, A) : 



-|X!/; ::!<>, (,xj>! / ' X(<) • 4%, ) )_ = ^ 



dt 



E- / dt 1 ...dt n ^A(t 1 )...A(t n )))_-X(t 1 )...X(t n ). 

n=l U ' J ' A 

Here ((}} is the irreducible part of the correlator, 
i.e. the n-point average with subtracted contribution re- 
ducible to the lower-order correlation functions. Here 
and in the future A(ti)A(i2) denotes direct tensor prod- 
uct, which is to be distinguished from scalar one, e.g. 
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X(t) • A(t). The averaging is performed over all trajec- 
tories with the given initial and final times (ti, tf) and 
fixed midpoint, A. 

Now, after we have related kinetic entropy S , to 
the statistics of A, it becomes possible to express an- 
other contribution to GE, 5(°)(A), in terms of regular 
thermodynamic entropy S(A). For doing so we notice 
that integration of the statistical weights of all trajecto- 
ries originating from a given point A is, by definition, 
the weight of the initial state, expS(A). After making 
a simple calculation based on this observation, one gets 
5(A) = S(°)(A) + E [§<5S(°tyaA] . The time interval At, 
and the associated change of A is assumed to be suffi- 
ciently small so that the linear expansion of in AA 
be valid. 

In a general case, E-functional is an awkward mathe- 
matical object, because of its non-local structure. How- 
ever, until this point we have not restricted the choice 
of the macroparameters, A. From the practical point of 
view, it is clear that the coarse-grained description of a 
system may be reasonable if one can choose a set of rela- 
tively slow variables as the macroparameters. Below we 
specify this choice in a more formal way. 

By definition, the form of E-functional (and the cor- 
relators of A) depends on the midpoint position, A. 
However, there naturally exists an interval <5A within 
which such dependence can be neglected. This allows 
one to introduce a concept of drift time, Thrift, over 
which most of the trajectories remain within this inter- 
val of constant statistics of A. Suppose there exists a 
shorter time scale, r <C Tdrift such that any correlator 
((A(ti)...A(t n ))) becomes negligible when 1^ — t n \ > t . 
In this case, one can choose the initial and final times 
such that tq *C At = tf — ti <C Tdri ft ■ This considerably 
simplifies the expression for E-functional: if we are only 
interested in the behavior of the system on times larger 
than tq, E becomes local in X: 



srx(t)] 



S(X(t))dt, 



here 



X" 



dii...dt„_ 1 ((A(0)...A(t n _i) 



(4) 



(5) 



We will refer to time scale To as ergodicity time. It can be 
shown that from the methodological point of view, the 
assumption of the existence of such time scale does play a 
role similar to that of the ergodic principle in equilibrium 
theory. 

Collecting the above results gives the following expres- 
sion for GE: 

S[A(i)# =5(A(t i ))+log J D[X(t)]expS'[A(t),X(t)}, 

(6) 



The first contribution here is the conventional entropy 
of the initial state, and the other one is the logarithm 
of the probability of a given evolution starting at that 
point. The latter is expressed in terms of the functional 
S' , which has a meaning of GE in (A, X) space: 



S'[A(t),X(t)} = 



dt 



A.|i£-X 
2 5A 



(7) 



This quantity becomes additive on the time scales exceed- 
ing To, which means that the dynamics becomes Marko- 
vian. This fact allows us to extend the applicability of 
the above expression to the case when tf — ti > Tdrift, 
i.e. when S(X) is no longer independent of A. 

In a particular case of reversible microscopic dynamics, 
S does not change if the sign of X reversed. This implies 
that the ratio of probabilities of direct and reversed evo- 
lutions along the same path A(t) is independent of the 
form of E-functional and is given by exp(A(ty) — A(ti) 
(see This property is known as Fluctuation The- 

orem ( B[ , which has been recently established for a wide 
variety of non-equilibrium systems. 

One can eliminate the "fictitious" variable X from Eqs. 



s[A(t)]l , = S(A(t i)) + S(A { t f )) 



(8) 



dt 



A 



(AW) - 5 ' 



2 5A 



here the kinetic entropy, which is now a local func- 
tional, has been expressed as a time integral of pseudo- 
Lagrangian A: 



A(t) 



A(A(t))dt = (9) 



log / D[X(t)] exp J dt S(X(<)) - X(f) • A(t) 



Because of the obvious analogy with classical mechan- 
ics, we will refer the above (A, X) and (A, A) forms for 
GE as pseudo-Hamiltonian and pseudo-Lagrangian ones, 
respectively. Though they are completely equivalent, the 
pseudo-Hamiltonian formalism requires introduction of 
additional variables X (which plays the role of momen- 
tum conjugated to measurable A), while the pseudo- 
Lagrangian form obscures the relationship between the 
conjugated functions A and S, given by Eq.(||). 

In the case of a distributed system, when both A and 
X are fields, A and 5 would typically become local func- 
tionals. If there is a global conservation law for one 
or several components of A, the E-functional, Eq. (||) 
is invariant with respect to the global transformation 
X(r) -> X(r) + 2 a 5 (a) nW (here index a counts all 
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the conserved components of field A, n^) • A). Exis- 
tence of this transformation means that E and 3 should 
depend only on gradients (and higher spatial deriva- 
tives) of the corresponding components of the field X: 
£ = / £(VX^)dtdr. If the microscopic fluxes of 
the conserved parameters can be introduced, the expan- 
sion of £ in powers of VX'"' is given by the form similar 
to Eq. (||), with all the correlators of A replaced with 
those of microfluxes j a . 

We now discuss the dynamics of the system in the de- 
terministic limit, which correseponds to the settle point 
of the GE functional. One has to emphasize that al- 
though the structure of the functional is similar to reg- 
ular action, the result of the variation procedure is dra- 
matically different from that in mechanics. In our case, 
only initial point A(i$) should be kept fixed, while the 
initial rate A(ij) (or, equivalently, the final point A(ty )), 
is subject to optimization. As a result, the current value 
of A, rather than the pair (A, A) determine the future 
dynamics of the system, and the equation of motion is 
of the first, rather than the second order. Namely, maxi- 
mization of the pseudo-Lagrangian form of GE, Eq. (||), 
with respect to A(tf), yields the following dynamic law: 



SA(A) 
SA 



ISS_ 
2SA' 



(10) 



This equation can be interpreted as a balance between 
thermodynamic driving force (the right hand side) and 
the dissipative force (the apparent physical meaning of 
the left hand side). An alternative description can be 
obtained by variation of the functional in Eq.(Q) with 
respect to A and X: 



SE 
SX 



X=i5S/5A 



(11) 



The equation shows how the system moves in response 
to the thermodynamic driving force, which in the deter- 
ministic limit appears to be identical to variable X. 

In the case of conserved components of A-field, Eq. 
(|Tf|), will be replaced with regular continuity equation, 
= -V • J (Q \ in which the macroscopic flux is given 
by constitutive equation, JW = S£/6(VX a ). It is in- 
teresting to note that the macroscopic flux can be in- 
troduced even if there is no well-defined fluxes on micro- 
scopic scale: this concept follows from the spatial locality 
of S-functional and conservation of quantity J A^ dr. 

"Kinetic potentials" A and 5 in the deterministic limit 
are related through Legander transform, i.e. 3(X) = 
A(A) + X • A, where X = -6A(A)/6A. In the vicinity 
of equilibrium the driving forces are small, and there- 
fore only leading terms in expansion (j|) remain relevant: 
S= (A)-X+ir' 2 '-X 2 . After substituting this expression 
for 3 into equation of motion ([TT]) , we recover the clas- 
sical linear response result: A = (A) + ■ SS/SA, 



i.e. the dissipative contribution to A is proportional 
to the thermodynamic driving force, and our result for 
the corresponding kinetic coefficient (see @) coincides 
with the one given by Fluctuation-Dissipative Theorem 
(FDT). It is a general practice to assume the same lin- 
ear rate-force relationship even in the regime in which 
entropy (free energy) is no longer a harmonic function 
of deviations from equilibrium. Langevin equation 
is one of the most noticeable examples of such an ap- 
proach. Furthermore, Ottinger et al || have recently 
proposed an elegant unified way of representing most of 
the known stochastic models in a single generic form, 
which again assumes a linear rate-force relationship for 
dissipative dynamics. Our Eq. (11) (or (|Io|)) in a gen- 
eral case would result in a nonlinear relationship between 
them and, in this sense, can be referred to as General- 
ized Langevin Equation (GLE) (an additional noise term 
will be discussed below). The fact that a particular form 
of the evolution equation depends on the correlators of 
macroparameters, suggests a possibility for a synergy be- 
tween our scheme and earlier field theoretical approaches 
to non-equilibrium statistical mechanics pc[ . 

In order to demonstrate how GLE works outside the 
linear regime, we consider a trivial kinetic problem: 
an ensemble of independent two-state systems, each of 
which has the same transition rate k in either direc- 
tion. The relaxation dynamics for the population dif- 
ference between the two states, A_ = Ni — N 2 is given 
by equation A_ = — 2kA_. Although linear, it is not 
a linear response result. The thermodynamic driving 
force conjugated to A_ is the chemical potential dif- 
ference between the two states, i.e. in our notations 
2A_ = dS/dN- = log N 2 /N 1 . This expression can be 
linearized in A_ (for constant N + = N\ + N 2 ) only suf- 
ficiently close to equilibrium, i.e. when 7V_ <C N + . This 
means that the simple linear kinetic equation is a result 
of a non-linear dependence of the response A_ on the 
driving force X-. In the considered case, one can calcu- 
late S(A_) exactly. In the limit of large N+, the number 
of switches happening over small time 5t is N + n5t. Since 
all the switches are completely uncorrelated, the original 
formula for ^-potential, Eq. (|^), results in the following 
expression for 5(A): 3 = A + Klog[cosh(2A_)]. Here we 
have taken into account the fact that the change of popu- 
lation difference, 5N_ is either 2 or —2 for any individual 
switch and the both possibilities are equally weighted. In- 
deed, any microscopic trajectory (sequence of individual 
switches) can be reversed, and this does not change the 
position of its midpoint, AC = (N-(ti) + N-(tf))/2. Af- 
ter using Eq. (^) , we recover the expected linear equation 
for N-. 

We now proceed with the discussion of fluctuations 
around the deterministic dynamics. Let A!°'(t) be a so- 
lution to equation of motion (|l(]), and a(i) = A(t) — 
A(°'(t) is the deviation of an actual trajectory from it 
(a(tj) = 0). The corresponding deviation of the gener- 
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alized entropy from its local maximum is given by the 
following quadratic expression: 

f tf A -1 / - \ 2 
SS [a(t)] = - j — ■ (a + A • k ■ a J dt = 

-j/ 8 " • (w 2 A _1 +«• ^-k) ' a -"^' ( 12 ) 

here A = £ 2 S/<SX 2 and k = \S 2 S/5A 2 . Note that the de- 
terministic trajectory is stable only if the response matrix 
A • k is positive-definite. Otherwise, any two trajectories 
originating from the same point diverge exponentially 
fast. By definition, exp (SS [a(t)]) is the statistical weight 
of a given trajectory. Therefore, the above quadratic 
functional, Eq. (|l2|), corresponds to Gaussian statistics 

of a, with (a w a_ w ) = (o; 2 A _1 + k ■ A • kj . Equiva- 
lently, this result can be represented by introduction of 
a random Gaussian noise rj(t) to the second equation of 
(^), with (r ] (t)r ) (t')} = X8(t-t'). 

The fact that the same matrix S 2 E/SX 2 controls both 
the strength of the fluctuations and the response to a 
small variation in driving force (see ( |l2] ) or (|TT|)), is a 
signature of FDT jjj [^) , which is conventionally applied 
only in linear-response regime. In order to extend FDT to 
our far-from-equilibrium case, we probe the system with 
time-dependent perturbation introduced as an addition 
to entropy: S(A,t) = So (A) + h(i) • A. In a particu- 
lar case of a Hamiltonian system with fast coupling to a 
thermal bath, h is proportional to field h' canonically- 
conjugated to A: h = —j3h!. The perturbation results 
in adding a coupling term ^ w iwh_ w a w /2 to functional 
SS, Eq.(|l2|). The response now can be determined by 
variation of SS with respect to a w . An important aspect 
of this procedure is that the causality principle should 
be taken into account: in ^representation, a(i) should 
be varied with the specified past and unknown future (as 
we did while deriving equations of motion, (|Io| ) and (0)). 
In lu representation, the response is given by 

r M = ^— = y(a w a_ w ) TC . (13) 

Here index TZ stands for the retarded part of the cor- 
relator, i.e. the one with all poles at > half- 
plane. The above relationship extends the classical FDT 
towards strongly non-equilibrium regime. It has to be 
stressed that T(uj) does determine the response to small 
perturbations but, in contrast to linear response regime, 
it does not relate the total driving force SS/S~K. to the 
evolution rate A (the relationships is given by non-linear 
Eq. ©or®). 

Surprisingly enough, the applicability of FDT might 
be extended even further, to time scales comparable or 
shorter than ergodicity time To. Namely, if the frequen- 
cies of interest are considerably higher than the charac- 
teristic relaxation rates (eigenvalues of relaxation matrix 



A ■ k) , we may neglect the dependency of the the driving 
force on a. In this case, the generalized entropy in (a, x)- 
representation can be written in the following form: 

S' = <r[x(t)] a[h(t)/2] + J*' a ■ - x(i)) dt. 

(14) 

Here x(t) is the deviation of X from its deterministic 
value X , and cr[x(t)] = S[X +x(£)] — S[X ] is the cor- 
responding deviation of E-functional, which is no longer 
assumed to be local, i.e. uitq is not small. In this regime, 
to which one may refer as sub-ergodic, the statistics of a 
need not to be Gaussian. 

It is easy to show that cr[x(t)] is the generating func- 
tional for a, i.e. its variations coincide with the corre- 
sponding correlators of &(t). On the other hand, in ac- 
cordance with |l4|), these variations of a determine the 
response of the system in any order of h, (u)i...u> n ) = 
S n B.-s Uk /Sh Ul ...Sh Un . This results in the following exten- 
sion of FDT to sub-ergodic time scales: 

n 

fW( Wl ... Wn ) = ((a_ a ,»a Ul ...a Un )) K J[ ^. (15) 

fe=i 

This version of FDT is remarkable: (i) it may be appli- 
cable to systems with considerable memory effects, e.g. 
glasses; (ii) it establishes the relationship between the 
non-linear response of a system and the deviation of its 
fluctuation statistics from Gaussian. The above rela- 
tionship is very similar to recent results for Markovian 
stochastic processes, ||. It should be noted that the di- 
rect experimental or numeric check of Eq. ( |l5| ) may be 
difficult to perform without correct interpretation of field 
h (its physical meaning is straightforward only in the 
case of ideal (fast) thermal bath coupled to the system). 
In particular, such interpretation may involve frequency- 
dependent temperature 

In conclusion, we have proposed a framework for 
construction of non-equilibrium macroscopic theory of 
a complex system, starting with its fundamental non- 
dissipative dynamics. This approach assumes the pos- 
sibility of choosing a set of relatively slow (adiabatic) 
variables. Our major results include the general form of 
equation of motion of the system under a given thermo- 
dynamic driving force (GLE) and extended FDT. Among 
the immediate possible applications of our scheme is 
the development of non-equilibrium statistical theories 
of various complex systems starting with their model 
Hamiltonians, such as Heisenberg and XY models j^| 
or Gross-Pitaevsky model of Bose condensate. An- 
other intriguing direction of the development of the gen- 
eralized entropy approach is its use for Landau-type 
description of bifurcations. It also provides us with 
an apparatus to study the problem of kinetic tunnel- 
ing between various steady states (attractors) of a non- 
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equilibrium system. A natural extension of our theory 
would be its quantum generalization. 
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